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BACKGROUND OF THE INVENTION 
[Field of the invention] 

5 The present invention relates to computerised qraDhic svstem<; anH in n *r+;~ t~ ^ . 

engine which provides for the manipulation of image daTa P d,SC '° SeS 3 graphics 

[Prior Art] 

10 Desktop publishing systems are known in the art which enable a user to ma nin..i ate • . • 

as to create or amend a visual image in some manner manipulate visual ,mage data so 

on j^rMsr: fz^rrs^sr ima9es which can be ~ 

and image quality on the other. However, for th ?2^^" nd *^° n * ,M8hand 

generally 24 times that of the black and wSTSSinS^rSS r '^'^ * PrOC6SSed ' S 
times are correspondingly greater. Accordingly, usmg ex,st,ng technologies, processing 

20 SUMMARY OF THE INVENTION 

nculL; Z^SnTTJJTS "i^^er^ 6, " ame ' i0rate thS ab — «°ned dif- 
image data J* to'ou.put the p^S ^ LV.S: r l^X^^^ IC ■ "V"* 0 *'' ° n C °'° Ur 
25 displays and colour printers suitable for reproduction us.ng existing visual 

and matt, combine,, wherein each of said colour interoo .tors f """W-M/ mterpol.lo, 

composers receive cole, Input dal, (or J£Z^T?£TV^? "mmann.. each of said 

inpir. d.,a in accordance with . pre-de,.!^,- SSlToXS ^^CSLSSSSH C °""'. 

^ r >- 9 «,o U ,,ma g edrrr„^ 

ation^r^rra^ 

output increases atarate equal to or in excess ofl 1 thLa.LnfcSf k ] ? in sa,d J um P ™de. said 
for said output to traverse between said TmiZum and said ZsZt ™^ d *"™** the «*•" 

BRIEF DESCRIPTION OF THE DRAWINGS 

A preferred embodiment of the present invention will now be described with reference to the drawings in 

g F ra S p h^engre; SChema,iC W °* °* P ™ ™** mani > ulati ° n which incorporate a 

S" 3 Is a SSHlr Jint J a9ram °I thS Preferred embodi ^ent of the graphics engine; 
Ng. 3 is a schema! c block d.agram of the render processor interface of Fig 2- 
F g. 4 is a schematic block diagram of the first-in-first-out register of Fig 3 ' 

F 2" 6 is a s^m^h^tH 9 Van '° US St3teS ° f read address Senerator of Fig. 4; 
Hg. 6 is a schematic block diagram of the control unit of Fig 2- 

Fig. 7 is a schematic block diagram of the run controller of Fiq V 

Fig. 8 is a schematic block diagram of the run length register of Fig T 
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Fig. 9 is a schematic block diagram of the run length counter of Fig. 

Fig. 10 is a schematic block diagram of the ramp generator of Fig. 7; Figs. 1 1 A and B illustrate the gener- 
ation of the ramp of the generator of Fig. 1 0; 

Fig. 12 is a schematic block diagram of one of the colour interpolators of Fig. 2; 

Fig. 13 is a schematic block diagram of the interpolator A of Fig. 12; 

Fig. 14 is a schematic block diagram of the interpolator B of Fig. 12; 

Fig. 15 is a schematic block diagram of one of the colour compositors of Fig. 2; 

Fig. 16 is a schematic block diagram of compositor A of Fig. 15; 

Fig. 17 is a schematic block diagram of the compositor B of Fig. 15; 

Fig. 18 is a schematic block diagram of the transparency interpolator of Fig. 2; 

Fig. 19 is a schematic block diagram of the transparency interpolator synchroniser of Fig. 18; 

Fig. 20 is a schematic block diagram of the transparency interpolator A of Fig. 18; 

Fig. 21 is a schematic block diagram of the transparency interpolator B of Fig. 18; 

Fig. 22 is a schematic block diagram of the matte combiner of Fig. 2; 

Fig. 23 is a schematic block diagram of the matte synchroniser of Fig. 22; 

Fig. 24 is a schematic block diagram of the matte delay of Fig. 22; and 

Fig. 25 is a schematic block diagram of the matte calculator of Fig. 22. 

DESCRIPTION OF THE PREFERRED EMBODIMENTS 

As seen in Fig. 1 A, a simple form of image manipulation system takes the form of a general purpose com- 
puter 1 , a graphics engine 1 0, a work screen memory 30 of typically 3 Mbytes maximum and a screen or monitor 
3 capable of high resolution. In this system images stored in the computer 1 are able to be displayed on the 
monitor 3 and manipulated via the graphics engine 10 and the keyboard (not illustrated) associated with th 
monitor 3. 

Another type of image manipulation system shown in Fig. 1 B is associated with a colour laser copier 5 which 
is preferably the CANON CLC500 COLOR LASER COPIER (Registered Trade Mark). The copier 5 includes a 
scanner 6 and printer 7 which are connected to a page store or memory 4. The page memory 4 is typically 96 

.M ^y tj.^ ^n tLne^A3 sj 

which is substantially the same as a standard video broadcast data rate of 1 3.5 M pixels/second. The computer 
1 and graphics engine 10 are substantially as before. 

As indicated in Fig. 1C ? it is possible to combine both the above systems by the provision of a data bus 26 
interconnecting the graphics engine 10, the work screen memory 30 and page store 4. With this configuration, 
a image read by the scanner 6 can be manipulated whilst displayed on the monitor 3 and then printed out using 
the printer 7. 

Fig. 1D illustrates a similar system save that a pan/zoom engine 9 is provided between the data bus 26 
and the work screen store 30 to further increase the nature of the image data manipulations able to be perfor- 
med. 

The system of Fig. 1E is concerned essentially with video images which can be either still video or lines 
of a live video for broadcast or other purposes. Here the graphics engine 10 provides an output (only) to a line 
store 40 which operates on the first in first out (FIFO) principle and which in turn outputs to a triple digital to 
analogue converter (DAC) 41 to provide the necessary output video signal. 

Lastly, as seen in Fig. IF, a inter-active video image manipulation system incorporates the computer 1 and 
graphics engine 10 as before, however, the graphics engine 10 both outputs data to, and receives data from, 
a double bufferred frame store 42. The frame store is able to receive video data from a video source 43 and 
output video data to a video destination 44. 

With the above applications in mind, the preferred embodiment of the graphics engine 10 of the present 
invention will now be described with reference to Fig. 2 and with general reference to an image manipulation 
system of the general type depicted in Fig. 1D. 

It will be seen in Fig. 2 that the graphics engine 10 includes an input data bus 11, which provides a video 
input, and an output video data bus 19. Also included within the graphics engine 10 is a render processor (RP) * 
interface 100, a run length controller 200, and a control unit 300. A data path includes interpolators 400 for each 
of red, green, and blue, a transparency interpolator 600 as well as compositors 500 for each of red, green, and 
blue and a matte combiner 700. An address generator 800 and a write controller 900 are also provided. 

Commands enter the graphics engine 10 via the RP interface 100 and both the bus 12 which provides RP 
data and the bus 13 which provides RP control. Signal lines 14 provide for direct memory access (DMA) control. 
The control unit 300 accepts two clock signals 15 which include a pixel clock running typically at 13.35 MHz. 
The control unit 300 reads data from the RP interface 100, interprets those commands received, and passes 
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the command operands to the run controller 200, and to the data oath inn,.*;™ ■ » 

operands are also passed to the address generator 80? mS^^IS^XSS 0 ^T^SSfT^* - * 0 **- Tfw " «««~««*- 
also responsible for decoding microcode from me Se Commands Thth !f ^ Th6 COntrol Unit 300 is 
execution devices within the engine 1 0. commands, wh.ch ,s subsequently used by respective 

The run controller 200 is initialised with the run length of the Dartieuiar nn „i,- 
unit 300 and produces a linearly increasing number in Srm «f ™ / 9 ? ,M command bv *e control 
run. This ramp is used by the interpolators ^400 to cont ml ntZ P ° m ° l ° 255 f ° r the duration <* the 

The graphics engine" 0 is configuredS ^^^^^^ ° 0mMn,to - 
is an industoy standard. The colour data path S red "TeZZ kT' ? (RGBM) infon ™tion which 

a blend command, is initialised by the contro, un ifabc S r2^ n ^ PO ? W " 4 ° 0 ' 9eneral Case of 
path produces a blend between these two 2 ours anwll 8nd 30 6nd COlour The c °'<»" data 

the current co.our. in the pixe. ^^^"1^^^ the resu., including a percentage of 
transparency data path includes a transparent ISiSSJeSSSl SST ' "t- RGBM ° Ut b " S 19 ' The 
of a transparency blend command, the Lsparenc SKS^^JT^T S, ' mP ' e 0386 
transparency and an end transparency. The transparent 1 1 T£ J* W,t 300 with a start 

^^^^^^ 

used by other execution units ma .nta.ns a p.pelme of status signals which contains the micro-code 

greela^^ J* three component colours, red. 

positor 500. The interpolators 400^ respond le for ZTa^Te ZZTT *" d a corn- 

are responsible for mixing the b.end with the pixel memorT ^^'^ 500 

The transparency portion of the data path also has an interoolator finn h.,f „„. 
positing to the matte plane, the green compositor 400 1 - use* Thl rn att J o k C f mpOS,tor - Whe " • com- 
compositors 500 with the mixing proportion which rtselei torn on. T 7 °° Pr ° V ' deS the co,our 

The matte output is also fed to the red composite 500 whS^I li I P<>SSMe ^Wency sources, 
to the matte p.ane. to be written to the "d pLne instead " ^ ^ M h ^ been written 

wil, nirbe 9 Sbed^™ T *" ^ * ™™ " "» -*» thereof 

previa™ ::,;r c r m s a c n o dT and r the execution — — 

command is comp.ete. the graphics instn.ctl s S.^^.^^.^i" d T l - the Previo - 
the address generator 800 proceed in oarallel anri em Tt »h r f . 9 he data patn 400/ 500 and 

the pixel clock PXCLK 15. A new pixll is th^eafter eXrf. T f ' minimUm ° f S ' X C,OCk P eriods « 
unless stalled by a peripheral «l«rK^2^^^ 0 V ^^- % the duratfo " of the instruction 
150 (Fig. 10 to be described hereafter) within thfmn conLlfJ ? nn 9 , ^ ,nstmction - a ra ™P generator 
0 to 255 and indicates to the controller 300 when tL tntn . PP 3 ' ine3riy increasi "9 number from 

controller 900 interfaces to the pixeTmernor^ 4 30 oMhefrl ? ' S ™ 3 " LAST " Si 9 nal - The «*• 

also interfaces to a further bus 20 s 3v^a the Jn/^ ^ V ' 3 bUS 21 " The write co °™^ 900 

track of when and where to read a^d S plxe.s e " 9ine 9 " nd WOrkscreen ™™<V 30. and keeps 

man^Ton^ 10- These are: compositing com- 

mon type of instruction given to the engSe "Sand Compositing commands are the most corn- 
operations in the page store 4 and 32 > l leen Tl lTlV™^ 1 1 ° M co,our "^ a r blending 
used to set up internal or external hardwire ^nnecZ f„ ? ^ Confi 9 ura «on commands are 
sation commands are used to keel Te graph^ eno ° 1 nT t0 ' XeCuti "9 ""^Positing commands. Synchroni- 
hardware events. Table 1 shows me thre^rvSs of ^tt S f ynchron,sed to intema ' ^^ware events or externa. 

The source and destination o Wend^oTplr mStrUCt,0nS a " d their res P ec fVe instruction codes, 
the workscreen 3 (WS). A^Zg y Te"! ^T^^^ 1 ^ C ° mP ° Sitin9 pafle —o^4 (CM) or 
compositing memory 4; compositinq^ * ° f ^P 05 ' 1 '^ commands: compositing within the 

ry composing w.th.n the workscreen memory 30; compositing fromthe compositing mem- 



BNSOOCID: <EP <M652S0A2J_> 



EP 0 465 250 A2 



ory 4 to the workscreen memory 30; and compositing from the workscreen memory 30 to the compositing mem- 
ory 4. During CM to CM compositing, one pixel is read and written in every pixel clock period (75 ns). Reads 
are performed in the first half of the pixel clock periods and writes in the second half. The graphics engine 10 
uses a i pipeline delay system in which data is transferred sequentially between various devices within the 
engine 10. Accordingly, there is a delay between reading the "old" value of a pixel, and writing the "new" value 
of the pixel. For this reason, the compositing memory 4 address changes between the first and second halves 
of the pixel clock period. 

Due to timing constraints in the compositing memory 4, there is one pixel clock period delay between issuing 
a read address, and reading the data at that address. This delay is compensated for by adjusting the position 
within the graphics engine 10 pipeline where the read data is received. The compositing memory 4 also has a 
one clock delay when writing pixel data, but this does not affect the internal operation of the graphics engine 
10. 

During WS to WS compositing, a maximum of one pixel is read or written in every pixel clock period (75 
ns). Reads are requested in one pixel clock period, and writes in another. Because of pipeline delays in the 
graphics engine 10, for the same reasons as described above, the workscreen memory 30 address changes 
between successive read and write requests. 

Due to timing constraints in the workscreen memory 30, there can be a delay between issuing a read 
request, and reading the data at that address. Handshake signals from the pan/zoom engine 9 indicate when 
the data is ready. This delay is compensated for by stalling the graphics engine 10 pipeline until the read data 
is received. The graphics engine 10 reads the data from the monitor or workscreen 3 in the clock period after 
the read acknowledge handshake signal is received from the pan/zoom engine 98e 8, in order to be compatible 
with the one clock delay in the compositing memory 4. Latches, (not illustrated and external to the graphics 
engine as earlier described), hold the data during this delay. The workscreen memory 30 can also have a delay 
when writing pixel data. Typically, the first pixel written will not be delayed, as the write request is "posted", 
and performed in subsequent clock periods. The pan/zoom ratio must be set to 1 :1 for WS to WS compositing, 
as will be understood by those skilled in the art. 

CM to WS compositing is used to transfer pixel data from the compositing memory 4 to the workscr n 
memory 30. During CM to WS compositing, a maximum of one pixel is read and written in every pixel clock 
period. Reads are requested from to§,^^J^sjtjn^^ the first half of the p ixel clock period. .Writes 

are requested from the workscreen memory 30 during the entire pixel clock period, but the address and data 
are only available in the second half of the pixel clock period. Separate addresses are maintained for the work- 
screen address and the compositing memory 4 address. 

Due to timing constraints in the compositing memory 3, there is a one pixel clock period delay between 
issuing a read address, and reading the data at that address. This delay is compensated for by adjusting the 
position within the graphics engine 10 pipeline where the read data is received. Due to timing constraints in 
the workscreen memory 30, there can be a delay between issuing the write request, and receiving the corre- 
sponding handshake signal from the pan/zoom engine 9 which indicates when the data has been accept d. 
This delay is compensated for by stalling the graphics engine pipeline until the handshake is received. The write 
address and data will be presented again in the second half of the next clock period until the handshake is 
received. Typically, the first pixel written will not be delayed, as the write request is posted, and performed in 
subsequent clock periods. 

CM to WS compositing can be used in conjunction with a reduction zoom ratio to reduce the size of the 
workscreen image. In a reduction mode, the pan/zoom engine 9 discards pixels from the graphics engine 10, 
and only writes every "NTH" pixel. This is transparent to the operation of the graphics engine 1 0, as the only 
difference it perceives is that writes appear to be completed quicker. 

CM to WS compositing can also be used in conjunction with a magnification zoom ratio to increase the 
size of the workscreen image. In expansion mode, the pan/zoom engine 9 reads a pixel from the graphics engine 
10, and writes the next "N x N" pixels with the same value. This is also transparent to the operation of the 
graphics engine 10, as the only perceived difference is that writes appear to take longer to complete. Note that 
only the first pixel address in a run is used by the pan/zoom engine 9. The graphics engine 1 0 generates addres- 
ses for subsequent pixels, but those addresses are not required by the current implementation of the pan/zoom 
engine 9. 

WS to CM compositing is used to transfer pixel data from the workscreen memory 30 to the compositing 
memory 4. During WS to CM compositing, a maximum of one pixel is read and written in every pixel clock period. 
Reads are requested from the workscreen memory 30 in the first half of the pixel clock period and writes are 
performed on the compositing memory 4 in the second half of the pixel clock period. If the workscreen read 
has not been acknowledged by the end of the pixel clock period, the pipeline will be stalled. The read request 
will remain pending, and the write to the compositing memory 4 will be repeated at the same address. Separate 
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addresses are maintained for the workscreen memory 30 addresses and tha ••■ 
Due to timing constraints in the workscreen rnXj^3^ 

request, and reading the data at that address. Again handshake sfanah^ih. * ? " 9 3 read 

when the data is ready. This delay is compensated for bv STlfnc ?S «™ 1 wnftoom engine 9 indicate 
5 data is received. The compositing mm^^^SS^i^-T'T 1 ° Unti ' *• fead 

not affect the initial operation of L graphics engine 1 0 * " ^ ^ but does 

WS to CM compositing can be used in conjunction with a redurtinn t« •„ . 

as it is copied into the compositing memory 4 In Ws irode £S n T IOto i?^ the s.ze of the image 

address during the first pixei in a run. but or^y interne nte thS address XT ^'V '"'"f" 363 3 wort «*reen 
» the graphics engine 10. This has the effect of landing fach ptTbuUs ^Zl T reqU6StS fr ° m 

g^PhicsenginelO.Notematon.ythefi^^ 

eng^ogeneratesaddressesforsubse^ 

engine °" M "* * UMd h with a " ~ pa ™" zoom ratio in the pan/zoom 
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enables" to the R.G.B and M planes of the wo*™? oA 3 to 0 of tne Parameter configure "writ 

writes to individual i«J^r^^^^ n 3 ?^^ The " write -^.es provide a way to disab.e 
bits, .mage and matte write enablestgrT^ 

Bits 6 through 0 are used to allow daL whS £ ,? Sab ' 6 W^,teS • but not to '"dividual planes, 

plane. Bits 10 through a^sltTcJltr * ■"""^ *° the P ' ane ' to be written to the red 

"bit map" instructions are used ^Z^S^-'T^T 9 * ^ SP6d ' UP ^ circu ^tances. When 
"skip bLatte read? •Z^^^^ZSZ'^T ^ ^ ^ ^ SBR " 6nableS the 

bit in the bit map mask ^£ZT5&£Z 71 s * re « uested from workscreen If the appropriat 
consume only one pixel clock period TtZ^JSZS' ^ " '* " Skipped " the pixel wi » 

the panlzoom engine 9 Text typical^ cc^nal °! * CUfrent «>"P'ementaHon of 

40 are substantial. * P * 3 ' ar96 percenta 9 e <* "while space", so the potential time savings 

appr^™ 

to allow the skip bit matte read or wr te ^^ ,1 r P V " 9 the SCreen P ' ane wite enables - ln °^er 
/zoom engine 9 it is ^ ** ^ implementa «- <* ** Pan- 

45 write cycle to override the oanXnnm J composing memory 4 control bus 21 in every read or 

the start on ^^^S!^^!^^ ™« SER and SEW bTts assert 

command should be set to one * t ' 0n - The mn le " 9th " field of the ,oad salt configuration 

so used in conjunction with the " execute lcr 0 c!S T V m " truct,on is snown in Tabla <■ This command is 
instruction specify which inLSn^T^^^S^ ^JT^ Bte 27 thrOU9h 0 of the 
into the specific register More than ^r^Lt r ? t , ! parameterfoll °wing the instruction is then loaded 
3. as defined in table 4 can bl oaSedTSll! 3 ," 1°^ ^ the Parameter For examp,e ' re 9 istere 0 * 
ameter. Typically, the "alt ^"T^ ■*» from the par " 

55 is issued. reg.ster would also be loaded, before the "execute microcode" instruction 

command.^^^ -ater described) cannot be loaded using the toad register 

The graphics pipeline ^S^^£^^JSl T*"? ^ ^s^' be »° aded - 

se reg,sters instead of the usual registers when the "execute microcode" instruc- 
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tion is executed. Table 5 provides a description of various registers provided within the graphics engine 10. 

Synchronisation commands are used to maintain the graphics engine 1 0 synchronised to internal and exter- 
nal hardware events. There are two synchronisation commands, namely no operation (NOP) and WAIT. 

The NOP instruction has no effect upon the graphics pipeline other than to consume one word and one 
clock cycle. It is intended to be used at the end of an instruction stream so that the length of the stream is a 
multiple of four 32-bit words. This ensures that the DMA controller, provided on bus 14 in Fig. 2, and which 
transfers data in four word bursts, will always send the last command. 

The WAIT instruction prevents the graphics engine 10 from executing new instructions until one of three 
events occur. Those events are: a hardware restart signal is received, a software restart is received, or the 
graphics pipeline becomes empty. Options to the instructions select which of the events will restart instruction 
execution. Pixels from the previous instruction which are still in the graphics pipeline, are not effected by this 
instruction. 

The hardware restart option allows the graphics engine 10 instruction execution to be synchronised to 
external "real-time" events. This is required when performing a "test scan" where the graphics engine 1 0 is used 
to copy an image from the compositing memory 4 to the workscreen 3, while the scanner 6 is writing to the 
page memory 4 simultaneously. The graphics engine 10 and the scanner 6 operate on opposite halves of the 
page memory 4, and swap over every eight lines. The instruction stream presented to the graphics engine 10 
is separated into multiple strips of eight lines. A single WAIT instruction is inserted between the compositing 
commands for each strip. This forces the graphics engine 10 to wait for the scanner to "catch-up" before pro- 
ceeding to the next step. 

A software restart option allows the computer 1 to be synchronised to the graphics engine 10 instruction 
execution. If the computer 1 needs to perform certain tasks between groups of graphics instructions, a WAIT 
instruction can be inserted between the groups. When the graphics engine 10 reaches the WAIT, an interrupt 
is optionally generated, and the execution of the new instructions is suspended until the computer 1 writes to 
a CONTROL register with the RESTART bit set, as will be discussed later. 

The pipeline empty restart option allows the graphics engine 10 to wait-until the previous instruction is fully 
completed before starting a new instruction. This instruction is generally not required in normal operation. It is 
only required if a design error allowed neighbouring i instructions to effect each other. Table 6 shows the bit 
relationship of the WAIT instruction. 

In addition to registers provided for the DMA bus 14 in Fig. 2, registers are also available for the computer 
1 to interface with the graphics engine 10. Table 7 lists internal registers of the graphics engine 10 showing 
the relative address, size and whether the respective address is readable or writable. The COMMAND register 
provides an alternate mechanism for writing graphics commands to the graphics engine 1 0. Writing to the COM- 
MAND register places the input data on the top of a FIFO 150 within the RP interface 100, and is effectively 
the same as writing via the DMA bus 14. The status of the FIFO 150 should be read from the STATUS register 
before writing to the-COMMAND register, to ensure that the FIFO 150 is not full. Writing graphics commands 
via the COMMAND register is relatively slow, as it is not expected to be used in normal operation. It does, how- 
ever, provide a means of "single stepping" the graphics engine 10 for debugging and self test purposes. Th 
FIFO 150 can also be written one word at a time. The control unit 300, however does not read the FIFO 150 
until it contains at least four words. Table 8 shows the arrangement of the command register. 

The CONTROL register is used for setting interrupt , masks, clearing interrupts, and enabling various test 
modes, and includes a bit configuration as shown in Table 9. 

The HIE bit is the hardware interrupt enable. If set to a 1, the computer 1 will receive an interrupt when the 
graphics engine 10 receives a WAIT command when the "hardware restart" option is enabled. This interrupt 
is cleared by writing a 1 to the CHI bit (even if the graphics engine 10 is stopped). The setting of the HIE bit 
can be read from the STATUS register. 

The EIE bit is the "ERROR" interrupt enable. If set to a 1, the computer 1 will receive an interrupt when 
the graphics engine 10 detects an error condition. This interrupt is cleared by writing a 1 to the CEI bit, until a 
new error is detected. An error is detected if a hardware restart or a software restart is received when the 
graphics engine 10 is not in the STOPPED state. In the case of a hardware restart, this indicates a loss of 
synchronisation with the external real time event. The current setting of the EIE bit can be read from the STATUS 
register. 

The RST bit is used to restart the graphics pipeline. When this bit changes from a 0 to a 1, the graphics 
pipeline is restarted. 

The SWR bit is used to generate a software reset When set to a 1, the entire graphics engine 10 is reset 
to the power on reset condition. The SWR bit does not need to be cleared by a programmer, as the CONTROL 
register is also reset 

The SEE bit is the "stop on error" enable. If set to a 1, the graphics engine 10 will enter the STOPPED 
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condition if an error is detected. 

Six bits designated TMn are used to enable test modes in hinri^c lA/.th.r, u- 
used to p u , trie run ,e„ 9 th c01 ,„,er 220 , nl0 a test ™™ ^, J^™^?**^™ V' ™° " 

The STATUS register is used lo monitor the Interrupt flags and the FIFO 150 status in -I,* 
number, and ,0 read the status of the vaHous tes, module's. tL. 9 sho^The SZ^ 

sif civ S E E ' E "" S re " ect *" sttte of *» "" ern "" masks s « h »» CONTROL register The HIF 
- ^JTt^Sr •» — — P-*. interrupter o^se';: 

in thrcONTRO^eg'str ^ C ° nd " ton * ™ S inKm "" fe a< » red *» a 1 to the CEI bit 

tI" gr F m"£ S ? "k" 6 ".^ F ' F ? 150 IS inaiCa " n9 ,he COMMAND register should not be written 

t^o?^ 

The DOE bit is set when a DMA controller, with the RP contrmior ion a^J~*~ 

the version number of the integrated circuit (LSI) chip. 9 techno, °9y- The Vn b,ts c ™*™ 

The graphics engine 10 inputs and outputs can be classified into four categories- 

cel™^ 3E£ computer 1 - and in lhe preferred embodiment - the ~ " «» p- 

2. Page memory 4 interface signals; 

3. pan/zoom interface signals; and 

4. other interconnect signals. 

Table 10 provides a list of each of these signals 

re^rLT^f 1 ( ? 60) ~ n 3CCess the 9 ra P hics ^gine 10 in three ways, either by DMA writes reoister 

^^^^^^ 

end IX^X^Z^XI ^ ^ Sd * 6SS 3 ~ 

band™d,Ltf? p ^^Zfoorf S 10 a " d the """""""H page memory 4 preferably has e 

oerM (75 n o £ST ' i m *«* b l* M f erse «"«<-One32-bitpUelisreadandwrlttenineverypixelclock 
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ing the data at that address. This delay is compensated for by adjusting the position within the graphics pipeline 
of the engine 10 at which the data is accepted. 

Furthermore, in order to relax timing constraints on the graphics engine 10, a one clock delay is inserted 
between issuing a write, and when the write is actually performed. This delay is transparent to the operation 
of the graphics engine 10. 

The interface between the graphics engine 10 and the workscreen 3 uses the same address and data bus 
as used to read and write to the compositing memory 4. The workscreen memory 30 is shared between an S 
Bus device, display refresh, and reads and writes from the graphics engine 10 via the pan/zoom engine 9. As 
a result, there is a delay between issuing a read request, and reading the data at that address. Handshake 
signals from the pan/zoom engine 9 indicate when the data is ready. This delay is accounted for by stalling the 
pipeline of the graphics engine 10 until the read data is received. The workscreen memory 30 can also have 
a delay when writing pixel data. Typically, thefirst pixel written will not be delayed, as the write request is posted, 
and performed in subsequent clock periods. 

The preferred hardware implementation of the graphics engine 1 0 can now be described having introduced 
various control functions that the graphics engine 10 provides. Not all hardware components are described in 
detail as some perform functions that are readily identifiable from their name and hence are readily understood 
by those skilled in the art. 

Unless otherwise stated, all latches and flipflops to be referred to use the main pixel clock signal PXCLK 
(operating at 13.35 megahertz). Enable signals do not clock these devices directly, or gate the clock, but simply 
select new data via a front end multiplexer which would otherwise recirculate old data. The various modules 
of Fig. 2 can now be described. 

The render processor interface 100 of Fig. 3 is responsible for controlling and monitoring the status of the 
graphics pipeline of the engine 1 0 from the computer 1 which is preferably an i960 processor. All control signals 
between the graphics engine 10 and the computer 1 are synchronous to the computer clock HCLK. The RP 
interface 100 resynchronises the signals to the pixel clock (PXCLK) which is used by the remainder of th 
graphics engine 10. The RP interface 100 is divided into a control block 120 and a first-in-first-out shift register 
(FIFO) 150. 

The RP controller 120 is not shown in detail but generally includes an address decoder, a control regist r, 
a stat^ register,, an interrupt, controller.and.a^QMAHCOntrollerv The address-decodepgenerates«registerselect 
signals for the control register, the status register and the command register. The control register, when select d 
by the address decoder is loaded with data from the computer 1 which is used to configure the interrupt, 
synchronisation, and test modes as earlier defined. The control register contains resynchronisation flip-flops 
to synchronise to the pixel clock PXCLK. The status register allows the computer 1 to monitor interrupt flags, 
the status of the FIFO 150, and other internal conditions as defined earlier. The status register also contains 
circuitry to regenerate synchronised reset signals and status outputs. The interrupt controller combines interrupt 
flags from the status register where the interrupt enables from the control register to produce the output INT of 
i the interface 100. The DMA controller generates a DMA request when ROOM_44, seen in Fig. 3, indicates 
that the FIFO 1 50 has room for four more words from the computer 1 . When the computer 1 grants a DMA cycle 
via the DACK signal, a new word is written into the FIFO 150 in each processor clock period where NEXT is 
asserted. If a DMA acknowledge is received when a DMA cycle has not been requested, an error condition is 
generated. A WRITEFF signal is asserted for each valid DMA word. 

The FIFO 1 50 as seen in Fig. 4 uses an eight word 32 bit dual port RAM 170 to interface the computer 1 
with the pixel pipeline within the engine 10. A write address generator 155, synchronised to the processer clock 
PLCK interfaces the computer 1 to the RAM 170. A read address generator 160, synchronised to the pixel clock 
PXCLK interfaces the RAM 170 to the pixel pipeline. Handshake signals between the two address generators 
are synchronised in each of their respective receiving circuits. 

The write address generator 155 operates synchronously with the i processer clock PCLK to produce an 
address and a write pulse to the RAM 170. The generator 155 also generates a signal to indicate when there 
is room for four more words, and two signals to indicate when the two halves of the FIFO 150 contain valid 
data. The FFMT output provides an indication to the computer 1 via the status register of the RP Controller 
120 that the COMMAND register should not be written. 

The generator 155 is implemented with a three-bit counter which is reset to 0 when the RP_RESET signal 
is asserted. The counter increments when the WRITEOFF signal is asserted. 

The read address generator 160 operates synchronously with the pixel clock PXCLK to produce an address 
to the RAM 170. The generator 160 provides a signal to indicate when the FIFO 150 is empty and two signals 
to indicate when the two halves of the FIFO 150 have been read. The generator 160 is implemented with a 
four-bit state machine which is initialized when the PX_RESET signal is asserted. 

Fig. 5 shows a state transition diagram which Dlustrates how the address and the "empty" signal (MT) are 
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generated within the read address generator 160. 

The control unit 300, seen in Fig. 6. interfaces the render processer FIFO 1 *n »„ th» 
unit 300 consists of instruction and microcode registers 305 and 310 an £Ji *** * P ' Pe "' ne - The 

instruction decoder 350. The instruction interpre^fSo na^L in '? Stn f' °" '^rpreter 320. and an 

> the registers 305 and 31 0 in accordance wiSTh!?^ . ,nCOm,n 9 'nstrucbons and parameters, loading 
instruction register 310 to ^prXrfs^usTord XhT" '^T^ ^ inStnJCti ° n deCoder 350 us « 

This code is used for test and debug purposes v P asse <> &y tne data in the m,cro-code register 305. 

ment of each new microcontroller a new or mori on f„c! ? m,crocontroller s and processors.ln the develop- 
data must involve the interp^n^^^^ * «** """ 

^4^ 

polation ramp, and to indicate when the^asToiS ! fn Jf « k the deration of an inter- 

is required that the run oorJ^^^J^^ ESTSJiSJ 0 ""ST?" fUnCti ° nS ' '< 
indicator. The run length is loaded from the FIFO 5f \ LZh1J1 h P ' 3 Va '' d ,ndicator - and a stall 

320 of the control unit 300 The start anVvali^on^ , " ,nstructed bv the instruction interpreter 

staMsigna, comes from the wme con^^^^ 

register 210, a run length counter 220. -nd^Jn^^o contro,,e '200 includes a run length 

The run length register210. shown in more detail in Fig. 8. includes latches 21? fln n ^ 
216. The latch 212 contains the run length of the next araoh « rnmm.nH tk 3nd 3 mult iP'exer 

still be running when the latch 212 is loaded The laS 2U J' * CUrrent 9r3phiCS comma ^ can 

an USE_ALT signal is asserted ^'^ an a ' temale mn ,en 9 th whicn is when 

posited and ind cates ^toi^S^JT counter 220 counts the number of pixel corn- 
value in the run length regfeter 210 Zt the START .ta! COUnter rt 22 °i s ' ni,ia '^d with one ,ess than the 
cycles, including the startle, less STaIl ^ h a " 

retains the current value. When the counter reaches 0 the , 9 ' Wh,Ch 0386 the COunter 

the run length is 1 In this case the LMT sil r f- 9 " a ' ' S asserted - Tn e minimum value for 

The ramp V'^^'^^^^T 1 1" * V."" "** 83 lhe START ^ is 
rea, time duration of the * ™ -er the 

by - STA^^f Tn ^^ a -Piexer switch 254 selected 

individual units which operate on selective oTts o fZ J!J ^ T?' represenl,n 9 the ™n '^gth. to four 
ROM 260 which comprises a l^p to^ o^l? n T^" ^ S '' 9nifiC3nt b ' tS 3re SUpp,ied to a 

word are supplied to a suSi^^SS V* "? COITeSpondin 9 to 255/(RL-1). All 13 bits of the 
supply a further mi^S^S^^jE^S'S 2^' ' RL \ S ^' a » 13 bi * 

mode detector circuit 275 which represent fiveln .fnp f " 1 ^ * m ° St si 9 nificant "ts supply a 

ROM 260, CONIST1 265 and CONST2 27n ™I, 1? ^ 6 Pr ° V '' deS 3 jump signal - Tne oul P ute °' 

to the flipflop 294. ' ba ° k 10 SW,tch the mull| P'exer 282 as well as supplying an input 

updatfof^^ 

when the run length is greater than 255) T^ni 7 o if r3mP ° UtPUt Va,Ue to Ch3nged (as in the ««» 
the run lenath hi— L 7 5 , _ t0r . 0 °P erates two modes. Firstly a 

"jump" mode when 

mode is determined by an examina ion 2?„ r 3 P m0de Whe " thS m " ' 6n9tn is ««an 255. The 

ramp value wi„ ^^^S^^^S ^'T ?* ° f ^ ' en9th " ,n 1,16 jump mode ' tne 
a maximum of one in each c^ck pirtod * Pen ° d - ,he St6P m ° de ' the ramp vaSue increases b V 
When in the jump mode, the output value from the ROM 260 is added to the accumulator 292 which is 
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initialized to zero by the START signal. The most significant bits of the accumulator form the ramp output as 
can be seen from Fig. 10. 

When in the step mode, the generator 250 applies an algorithm similar to Bresenham's line drawing 
algorithm, which is known to those in the computer graphic arts. The accumulator 292 is initialized with the con- 
stant (511 - RL), and the ramp counter is initialized to zero. In subsequent clocks, one of two constants are 
added to the accumulator:- 510 if the accumulator is negative, or, (512 - 2RL) if the accumulator is positive. 
The ramp counter (flip flop 294) is incremented on every clock where the accumulator 292 is positive. Note 
that thestep mode requires the multiplexer 254 to bypass the latch 252 during the START cycle. This allows 
the new value in the CONSTI module 265 to be loaded into the accumulator 292 in the START cycle. The mul- 
tiplexers 280 and 282 are shown in Fig. 10 for clarity only. A simpler implementation for the multiplexer 280 is 
a string of AND gates. Similarly, the multiplexer 282 can be implemented as a string of OR gate for the nine 
least significant bits, and a string of AND gates for the most significant bits. 

The colour component data path of the graphic engine 1 0 consists of three eight-bit data paths. Each eight- 
bit path contains an interpolator 400, which also performs colour correction, and a compositor 500. The red, 
green and blue interpolators are identical. The three compositors, however, are slightly different. The green 
compositor is used by the matte channel in some instructions and the resulting differences in the green channel 
are described later in this specification. The colour interpolators 400 comprise three modules: a synchronisation 
module 41 0, an interpolation phase A 420, and an interpolation phase B 450. The synchronisation module 410 
spans stages 0 and 1 of the graphics pipeline and holds operands for the current instruction, and instruction 
operands for the next instruction. The interpolation phases A and B (420 and 450) perform blends and colour 
correction. The two phases are required in order to maintain one pixel per clock cycle. Phase A (420) is in stage 
2 of the pipeline, and phase B (450) is in stage 3. 

The synchronisation module 410 is not shown in detail but however comprises latches and flipflops which 
provide the start colour, the end colour and the pixel value. 

The interpolation phase A 420 is shown in Fig. 13 and performs two types of calculations, either blends, 
or colour corrections. Those skilled in the art will appreciate that colour runs are simply a degenerate case of 
a blend when the start and end colours are the same. The phase A interpolator 420 performs calculations for 
the first part of both types, namely 

for blends, C_MULT = (E ND. START)* RAM R/256 * - 
for colour correction, C_MULT = (END*PIXEL/256). 
The interpolator phase A420 comprises a summer 4Z2 which subtracts the START value from the END 
value. The multiplexers 424 and 426 change the inputs to the multiplier 430 respectively. For blend commands, 
the summer 422 has a nine-bit two's complement output which is fed to the multiplier 430. For colour correction 
commands, the END value (which also corresponds to the CONTRAST) is fed to the multiplier 430 when the 
sign bit is set positive. In blend commands, the second input of the multiplier 430 comes from the ramp generator 
250 of Fig. 10 and has the effect of controlling the mixing proportion. In colour correction commands, the second 
input of the multiplier 430 comes from the commands pixel string. A new pixel to be colour corrected is loaded 
every non-stalled dock period. Only the top nine bits and the sign of the multiplier output are required. The 
output is a 10-bit two's complement number. Rounding is not performed as a one least significant bit error is 
considered acceptable. 

The STARTED output contains a delayed version of the START value for the next phase of the interpolator. 
In colour correction mode, the START value is equivalent to the BRIGHTNESS. 

Phase B 450 of the interpolator 400 performs the second part of both of the blends, and colour correction 
calculations, namely 

for blends, D__OUT = C_MULT + STARTED = ((EN D-START) * RA M P/256) + START 
for colour correction, D_OUT = (C_MULT*2) + START_D = ((EN D_START)* RAMP/1 28) + START 
It should be noted that rounding errors introduced by the multiplier 430 of phase A 420 lead to an error of 
one in the final value of the- blend. That is 

for END-START>0. D_OUT( FINAL) = END-1 
for END-START>0, D_OUT(FINAL) = END + 1. 
The phase B circuit 450 comprises two multiplexers 452 and 454 both supplying a summer 458. The sum- 
mer 458 supplies a limiter 460 which, in turn, supplies the output via a latch 462. The multiplexer 452 changes 
the inputs to the adder 458 to perform the multiplication by 2 for the colour correction function. The most sig- 
nificant bits pass straight through to the adder 458. For blend commands, this has the effect of signextending 
bit 8 by one position. 

The multiplexer 454 adds an extra two inputs to the adder 458 to perform a sign-extension for the colour 
correction function, which uses a 8-bit signed format for the BRIGHTNESS. For blend commands, these two 
bits are set to 0, as the START value is unsigned. The adder 458 sums the two 10-bit signed numbers output 
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from he multiplexes 452 and 454 to produce a 10-bit signed result, and a positive overflow indicator fPOV> a 
ne 9 at,ve overflow » not possible in either blend or colour correction modes, .n blench \7Zei^Z^e t 
always positive, and m colour correction mode, the MULT value is always positive 

The adder 458 output is passed to the limiter 460 which is used to saturate values greater than 255 to 25s 

ourIourcI £2 VT This fan ?°JLT be dfeab,ed by an EFFECTS si9nal ^nSS^^SS 

uuk SOURCE field in the command. The m ter 460 has no effort in n.n rt r ^ . 

clock penod, in readiness for the next pipeline stage. ^ stalled 

The limiter 460 is not shown in detail as its configuration can be readilv derive f mm 
The limiter 460 has an input bus range from -128 to %638. * ^ abOVB desc "P t,on - 

Fig. 1 5 shows a colour compositer 500 which includes four modules comprising a synchronisation mnW.,. e 
520. a compositing phase A 540. a compositing phase B 560, and a matte multiplexer 580 Th^h™ ^ 
module 520 is in stage 4 of the graphics pipe.ine. .t latches the RGBf^ data rcm th TexZlZToSnaZ 

W0 . rk «T n J nem ° ry <28 ° r 30) 3nd s >" chron ^ 't with the pixel pipeline. Compositfng phases TZ B So 
and 560 perform a blend between the original value in the externa, memory, and the neJ vafue fr ! LI' 

co^r 9 t po t 4o °- The proportion between o^i9ina, and new - «Swi- byi: ssizfzsz 

comes from the matte combiner 700. Two phases are aaain r»nuir*H in nrw., ♦ V ' cn 

tplexer 580 allows th. data from the made channel to be output on one ot more of Z ooteL'r „1„T, ^ 

- o^r^r;^^ 

Phase A 540 of the compositor 500. seen in Fig. 16. performs the following calculation- 

C_MULT = (C_ORIG - C_NEW)»BLEND/256 
Phase A 540 includes a summer 542. a multiplier 546 and output latches 548 and «5«5f> an r. nmm ^ 
the same format with 8-bit unsigned, values on each of the NEW OR.G and BLEND fn Duf f Th f ?^ 86 
30 the adder 542, which, having an inverted input functions as a subtrac ter' Z 1 ? w/S , * 

which is ted to the multiplier 546. The second input to"^ 'J^StS ^^1^2^^ 

Z to^a J, T> COntr ° ,,in9 mixi "9 P r °P° rti -- A value of 0 represents a fully o£j?S£X o^E 
the top 8 bits and the s.gn of the multiplier 456 output are required. The output is a 9 bit , 

3 5 rw^o^rz;; 393 ' 0 ^ 1 pe ri rformed as a one ,east si9nmcant « ^^ssL^arK 

^S^S^:^ delayed version of the NEW vaiue for the next phase (560) of the compositor 500 
Phase B 560 ^ compos, tor 50 i m shown in Fig. 17 and performs the fo.lowing calculation: 
m f fi* * D - MULT = C - MUL T + C_NEW_D =((ORIG-NEW).BLEND/256) + C NEW 
for „ !!! 9 e ; rore j ntroduce d by the multiplier 546.in phase A 540 lead to a"n error of at least one 

for non-opaque compositing. For example: 
40 when ORIG-NEW>0. 

D_OUT (fully transparent) = ORIG-1 , 
when ORIG-NEW<0. 
D_OUT (fully transparent) = ORIG+1. 
Fully opaque compositing will always produce the exact NEW value That is- 
45 D_OUT(fully opaque)=NEW. 

A wirh^S^ifun^^ 8 T "I' 6 ' T JS* SUmS 016 9 - bit Si9ned number from the m «'«P«er S46 in phase 
A w.th the 8-b.t unsigned value from the ORIG delay flip-flop 550 also in phase A 540 The result is alwavs in 
the range 0 to 255 as the compositor 500 can only produce numbers between the ORIG and NEW v^s The 

so stg^r^ 

700 ^TZTn!^ I 31 ' Pa l h S6en ! ig - 2 com P rises * transparency interpolator 600 and a matte combiner 
Lee i s s?ART and ^ 1m f KK ' ,0 9 enerate a ^nsparency blend by interpolates be- 
tween a START and END transparency, and to combine the transparency blend with the matte plane 

ss mod e^oTn Z^^Te^T « ^ ? ^ »~ ^ ~nisa«on 
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stage 3. 

Fig. 19 shows the various components within the synchronisation module 610. Latches 612 and 614 hold 
the START and END transparency parameters for the next instruction. This corresponds to stage 0 of the 
pipeline. Flip flops-620 and 622 load these values when the START signal indicates the first pixel of a new 
instruction is in progress. A third latch-616 holds the texture start parameter for the next instruction. This is a 
2-bit parameter which determines the byte offset into the initial texture word for texture run and blend com- 
mands. It is loaded into a third flip-flop 624 via a next offset module 61 8 when the START signal indicates the 
first pixel of an new instruction is in progress. A fourth flip-flop 626 holds the texture for up to four pixels in texture 
run and blend commands. The flip-flop 626 is loaded from the data bus DJN in every fourth non-stalled clock 
period. A fifth flip-flop 628 holds the transparency for each pixel value in pixel string commands, and is loaded 
from the data bus in every non-stalled clock period. 

The next offset module 618 is used to control the texture byte offset, and the texture word flip-flop 626. For 
texture run and blend commands, the texture byte offset is Initialised when the START signal is asserted, and 
counts by one in other non-stalled clock periods. When the offset equals 3, the LASTT signal is asserted! The 
texture word flip-flop 626 enable signal is also asserted at this time as well as during the start cycle 
Phase A 640 of the interpolator 600 is shown in Fig. 20 and performs the following calculation: 

TJvlULT = (T_END-T_START) * RAM P/256. 
Phase A 640 includes a summer 642 configured to subtract two input data bytes. The output of the summer 
642 is a 9-bit two's complement output which is fed to a multiplier 644. The second input of the multiplier 644 
comes from the ramp generator 150 of Fig. 10 and has the effect of controlling the mixing proportion between 
the start and end values. Only the top eight bits and the sign of the multiplier 644 output are required. The output 
is a 9-bit two's complement number. Again, rounding is not performed as one least significant bit error is con- 
sidered acceptable. Phase A 640 also has a T_START_D output which contains a delayed version of the 
T_START value provided by a latch 652 for the next phase of the interpolator 600. A multiplexer 646 uses the 
T_OFF signal from the synchroniser 61 0 to select one of the four texture bytes in the T_PIXELS input. In texture 
run and blend commands, a different byte is selected in each pixel clock PXCLK period. In pixel string com- 
mands, the transparency is selected from the T_PIXELS input via a second multiplexer 648. The texture is 
latched (654) to maintain synchronisation with pipeline stage 2. 

Phase B 670 of the interpolator 600 is shown in fig. 21 and performs the second part pf blejid calculations, 
namely: 

TR_OUT = TJvlULT + T_START_D = ((T_END-T_START)* RAMP/256 + T-START. 
Note that rounding errors introduced by the multiplier 644 in phase A 640 lead to an error of one in the final 
value of a blend. That is: 

for T_END-T_START>0,TR_OUT(FINAL) = T-END-1 
for T_E N D -T_S TA RT< O , TR_0 U T( Fl N A L) = T_END + I 
In Fig. 21, an adder 672 sums the 9-bit signed 

value from the multiplier 644 in Fig. 20 with the 8-bit start value. The result is always a positive 8-bit value 
since it must be in the range T_START ... T.END. A multiplexer 676 selects the output of the adder 672 for 
run or blend commands, or the texture byte for other commands. In bitmap commands, and commands which 
do not have subsequent words, the opaque signal is asserted. This has the effect of forcing the texture to 25S 
(opaque). 

The output from the multiplexer 676 is latched (678) in every non-stalled clock period, in readiness for the 
next pipeline stage. The transparency value from the output of the "opaqueing" module is also latched (680) 
and passed separately to the matte combiner 700. The matte combiner 700 uses this value when the "matte 
source = subsequent" option is specified by the instruction set. 

The matte combiner 700 is shown in Fig. 22 having four modules including a synchronisation module 710. 
a matte calculator 730. a matte delay 720, and a green multiplexer 790. 

The synchronisation module 710 is in stage 4 of the pipeline. It latches the matte plane of the RGBM data 
from the external compositing 4 or workscreen 30 memory in the first half of the pixel clock period. Based on 
this value and the transparency from the transparency interpolator 600, the matte calculator 730 produces a 
blend value for the colour compositors 500, and a new matte value for the compositing memory 28. Matte cal- 
culations are performed in stage 4 of the pipeline. The delay 720 passes the matte value unchanged to the 
green multiplexer 790 to synchronise with the colour components which are calculated in stages 5 and 6. The 
green multiplexer 790 allows the data from the green channel (400, 500: Fig. 2) to be output on the matte plane 
of the compositing memory 4. This is performed in stage 7 of the pipeline. 

Fig. 23 shows the components within the matte synchronisation module 710. A latch 712 produces a 
delayed version of the STALL signal to compensate for the one clock delay in the compositing memory 28. The 
input data from the external memory is asynchronous to the main internal pixel clock PXCLK. It is, however, 
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synchronous to the CLK 2 signal which is used to derive the pixel clock via a NAND gate 714 so the Dhase h 
fixed. Th.s data is latched (716) in the first half of the pipeline stage 4. 

The matte calculator 730 is seen in Fig. 25 and performs two calculations: 

BLEND = LIMIT (TR_IN-M_SYNCD) 
M_OUT = LIMIT (M_SYNCD-TR IN) 
where LIMIT (a-b) = 0, if a - b < 0 ' ~ 

Both the TRJN and the M_SYNCD inputs are 8-bit unsigned values. The components in the too half of 

Si C f C t te K t 1 he 1 BL l ND ° UtPUt Wh ' Ch iS US6d t0 ^ the mixin 9 ° ro P° rti ° n in ^ colourcomposi o2 
500. input to h,s block with a value of 0 represents fully transparent compositing. An output value of 0 howevT 

represents fully opaque compositing. The BLEND value is calculated by substracting, using an adder 734 the 

matte data ,n the composing (or workscreen) memory 28 from the transparency which has either a run blend 

the ider 7^ A "Tf VST*?* ^ l ° 0 " the reSU,t * ne9ative « as indicated b * *• car^ bit from 
the adder 734 A mult.plexer 742 can bypass this calculation and select the input transparency if *e T M- 

Z ^7 " "° TTH he T - M,NUS - M Sl '9" al «■ ^coded from the command word in the confrofun t 
i h compensate for the different input and output sense of the transparency, the multiplexer 742 is nonTially 
mverted by an EXOR block 750, or is passed straight through to the output via a .atch 752 when theT.NV 

siQricM is 3ss@rt@o. — 

The components in the bottom half of Fig. 25 calculate the M_OUT output which is written into the matte 

clmnoSnYf'" 6 * U8,n » m a * tar ™> ^nsparency from the ma«e d^ta 7tE 

composing (or workscreen) memory 4,30. A further limiter 740 saturates this value to 0 if the result is negative 
as mdicated by a carry from the adder 736. Asubsequent mu.tiplexer746can bypass this calculation and se leci 
the .nput transparency ,f the M_M.NUS_T input is not asserted. A further multiplexer 748 selects the subs T 
value if the matte source field of the instruction specifies "subsequent". Both the BLEND M OUT VALUES a7e 
latched to synchronise to stage 4 of the pipeline; ~ 

^nl^ matted , e !! y 72 °' S6en 24< PaSSeS the matte value ""^hanged to the green multiplexer 790 to 

5 and TZ " h fl fl C °T; entS r hiCh afe Ca,CU ' ated in sta 9 e 5 and 6 - A fli P- fl °P 722 synchronises to stage 
5, and a second flip-flop-754 synchronises to stage 6. 

The address generator block 800 of Fig. 2 is responsible for providing read and write addresses for the 
composing memory 4 and the workscreen memory 30. A read cycle and a write cycle are performed in each 
pixel clock penod. Compositing memory 4 addresses are provided by a page - address generator within the 
address generator 800. and workscreen addresses are provided by a screen address generator also within he 
generator 800^ The address generator 800 operates in a manner known to those skil.ed in the art in selecting 
appropnate addresses with.n the workscreen and compositing memories. Having se.ected the appropriate 
address e,ther the page or screen addresses are multiplexed as determined by read and write signals The 

I r l e „ a , H 6 f PaSSGd thr ° U9h 3 fiVe Sta9e d6lay l ° com P ens ^ ^ delays through the colour and trans- 
parency data paths. 

The write controller 900 of Fig. 2 performs three main tasks, namely: 

1. To implement additional functionality required for "bitmap mode" instructions- 

2. To provide appropriate pipeline delays of write control and other status information- and 
40 3. To interface to the compositing memory 28 and the pan/zoom engine 9 handshake signals 

*nH '!, , 6 ! f S 9 enerator 80 °- 1"* write controller 900 forms functions known to those skilled in the art 
and a deta.led explanation is not required in the specification. Essentially, the write controller 900 includes three 
stages compns.ng a b,t map shift, a status delay, and an I/O control unit, .n bit map mode instructions, a single 

tc JeZ l P Jn T B ^r 0 "™ and other status information. It also contains a block which is used 

™nr! f o I T! ^ ,S 6mPty - The 110 COntr ° ,,er interfaces to the compositing memory 4 and the pan- 
/zoom engine 9 handshake signals. 

^hJI 1 !^ 90 !! 19 ■ d , eSCriP K 1 0 1 PrOV j deS a " architecture fa y which a graphics engine 10 can be produced to 
achieve colour desktop publishing functions. 

The preferred embodiment of the graphics engine is able to implement a wide range of commands so as 

VaTpIXELS CANVAS 3 TfSurp^ Z™"**™?**™^™™ as CANVAS RUN, CANVAS BLEND, CAN- 
tk J^S^S T EXTURE RUN and CANVAS TEXTURE BLEND will now be described. 
The CANVAS RUN function composites a run of colours with the image, with compositing controlled by 

textu7ed S m a a S ^ "T* T' I"" oomn » nd Simu,ates the interaction * P»" «*h -nvas o other 
SStSSS'.tl . 1 m K the back 9 round Progressively "fill up" with "paint". The canvas run com- 

r ■! "*? 9 " Umber ° f Pixe,s 0f a co,our e * ual to the start «>'«». starting at the page 

address The mtens.ty of composing is controlled by the difference between the transparency run and the 
matte plane of the image. The matte plane is changed to. be the difference between the matte value and the 
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run transparency. 

The operation is as follows for n=0 to Run_length-1 

RGB (image) [address+n] 
= ( (T (start) .Matte (image) [address+n] ) *rgb (start) 
+ (256- (T (start) .Matte (image) [address+n) ) ) 
*RGB (image) [address+n] ) /256 

Matte (image) [address+n] 
= Matte (image) [address+n ]. T (start ) 

where . represents a subtraction operation with underflow limiting. 

The canvas run command is used for compositing object based constant colour "brush strokes" onto a text- 
ured background, where it is desirable to simulate the cumulative interaction between paint and a textured back- 
ground. To achieve this effect, the page matte should contain an image which corresponds to the texture of 
the background. 

The CANVAS BLEND function composites a blend of colours with the image, with compositing controll d 
by the blend transparency and the matte plane. This command simulates the interaction of paint with canvas 
or other textured materials, as textured dents in the background progressively "fill up" with "paint". 

The canvas blend command composites the run length number of pixels of a colour smoothly changing 
from the start colour to the end colour with the image, starting at the page address. The intensity of compositing 
is controlled by the difference between the blended transparency and the matte plane of the image. The matte 
plane is changed to be the difference between the matte value and the blend transparency. 

The operation is as follows for n-O to Run/length-1 
Interpolation [n] = n/ (Run_length-1) 
T[n] = Interpolation [n] *T (end) 

+ d-lnterpolation[n] ) *T(start) 
RGB [n] = Interpolation [n ] *RGB (end) 

+ (1-interpolation [n] ) *RGB (start) 
RGB (image) [address+n] 
= ( (T[n) .Matte (image) [address+n] ) *RGB [n] 
+ (256- <T[n] .Matte (image) [address+n])) 

*RGB (image) [address+n] ) /256 

■t 

Matte (image) [address+n] 
= Matte (image) [address+n] .T[n] 

where . represents a subtraction operation with underflow limiting. 

The canvas blend command is used for compositing object based blended colour "brush strokes'' onto a 
textured background, where it is desirable to simulate the cumulative interaction between paint and a textured 
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background. To achieve this effect, the page matte should contain an image which corresponds to the texture 
of the background. c 

The CANVAS PIXELS function composites a run of pixels with the image, with compositing controlled by 
the prxel transparency and the matte plane. This command simulates the interaction of paint with canvas or 
other textured materials, as textured dents in the background progressively "fill up" with "paint" 

The canvas pixels command composites the run length number of pixels with the image, starting at the 
page address. The intensity of compositing is controlled by the difference between the pixel transparency and 
the matte plane of the image. The matte plane is changed to be the difference between the matte value and 
the pixel transparency. 

The operation is as follows tor n=o to Run_l eng th-1 

RGB (colour corrected) [n] 

•# 

- Limit ( (RGB (contrast ) *RGB (pixel) (n])/256+ RGB(bright)) 

RGB (image) [address+n] 
= ( (T (pixel) .Matte (image) [address+n J ) 
*RGB (colour corrected) [n] 

+ (256- (T (pixel) .Matte (image) (address+n])) 
* RGB (image) [address+n] ) /256 
Matte (image) (address+n] 
= Matte(image) (address=n J .T (pixel) 

where . represents a subtraction operation with underflow limiting. 

The canvas pixels command is used for compositing pixel based 'brush strokes" onto a textured back 
ground, where it is desirable to simulate cumulative interaction between paint and a textured background To 
achieve th.s effect, the page matte should contain an image which corresponds to the texture of the background 
Th.s command can also be used for simulating screen printing of a natural image onto a textured background* 
„ r. C * NVAS TEXTURE RUN function composites a run of colours with the image, with compositing con - 
troUed by the texture and the matte plane. This command simulates the interaction of textured brush strokes 
wrth canvas or other textured materials, as textured dents in the background progressively "fill up " with "paint" 

The canvas am command composites the run length number of pixels of a colour equal to the start colour' 
starting at the page address. The intensity of compositing is controlled by the difference between the i subse- 
quent texture bytes and the matte plane of the image i The matte plane is changed to be the difference between 
the matte value and the texture bytes. 



The operation is as follows for n,G to Run_length-1 
RGB (image) [address+n J 
^ - ( (Texture [n] , Matte (image) [address+n] > *RGB (start) 
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+ (256- (Texture [n] .Matte (image) [address+n] ) ) 
*RGB (image) [address+n] ) /256 
5 Matte (image) [address+n] 

= Matte (image) [address+n] .Texture [n] 

10 where . represents a subtraction operation with underflow limiting. 

The canvas texture run command is used for compositing textured object based constant colour "brush 
strokes" onto a textured background, where it is desirable to simulate the cumulative interaction between paint 
and a textured background. To achieve this effect, the page matte should contain an image which corresponds 
to the texture of the background, and the subsequent texture bytes should contain a texture which relates to 

f5 the texture of the paint medium. 

The CANVAS TEXTURE BLEND function composites a blend of colours with the image, with compositing 
controlled by the texture and the matte plane. This command simulates the interaction of textured brush strokes 
with canvas or other textured materials, as textured dents in the background progressively "fill up" with "paint" 
The canvas blend command composites the run length number of pixels of a colour smoothly chanqinq 
from the start colour to the end colour with the image. The intensity of compositing is controlled by the difference 
between the subsequent texture bytes and the matte plane of the image. The transparency values of the start 
and. end colour are ignored. The matte plane is changed to be the difference between the matte value and the 
texture bytes. 
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The operation is as follows for n-O to 
Blend_length-1 

RGB [n ] - Interpolation [n ] *RGB (end) 
+ d-Interpolation[n] ) *RGB (start ) 

35 

RGB (image) [address+n] 
- ( (Texture [n] .Matte (image) [address+n] ) *RGB [n] 
„ + (256- (Texture [n]. Matte (image) [address+n])) 

*RGB (image) [address+n] ) /256 
Matte (image) [address+n] 
45 = Matte(image) [address+n] . Texture [n] 

where . represents a subtraction operation with underflow limiting. 

The canvas texture blend command is used for compositing textured object based constant colour "Brush 
so strokes onto a textured background, where it is desirable to simulate the cumulative interaction between paint 
and a textured background. To achieve this effect, the page matte should contain an image which corresponds 
to the texture of the background, and the subsequent texture bytes should contain a texture which relates to 
the texture of the paint medium. 

It will be apparent to those skilled in the art that the above described functions result in the matte plane 
55 being changed by either the colour information (transparency) of the image or the texture information of the 
image. Subsequent manipulations are based on the changed matte plane. 

The foregoing describes only one embodiment of the present invention, and modifications, obvious to those 
skilled in the art can be made thereto without the party from the scope of the present invention. 
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synchronisation throughout the various pipeline stages throuoh Itaotlh PreSe "' emb °<"' m o™ '° enable 
capaW. of operating a. much higher speeds theTcUssino rtT.! 9 "IT* 1 °' Wilh «* « 

become insignifican, and the reoSrernen, for ^£^7^7^^ itemS - 
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0000 
0001 
0010 
0011 



0100 
0101 
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01H 



Compositing 
Synchrony sation 
Synchronisation 
Configuration 
Configuration 



Configuration 



Compos i ting 



Instnif 

COMPOSITE 

NOP 

WAIT 

LOAD ZOOM 

LOAD CONFIGURATION 



LOAD REGISTER 

spare 

spare 

EXECUTE UCO0E 



. fields ParamPt-pr^ 

Multiple Multiple 

HW/SW/PL 

Zoom ratios 
Skip mode, write 
enables, mux 
enables 
Register # Register value 
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TABLE 3 



* 15 M 13 12 " '0 9 8 7 6 <- , , 
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TABLE 4 
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Function O i j 7 



27 26 25 24 23 22 



21 20 19 18 17 16 
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Function 



15 14 13 12 11 10 9 
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TABLE 5 



Reg # NAME 



DESCRIPTION 



0 STORED Start value of red component for run or -blend. Also used 

as red Brightness. 

1 ST_GREEN Start value of green component for run or blend. Also 

used as green Brightness. 

2 ST_BLUE Start value of blue component for run or blend. Also 

used as blue Brightness. 

3 ST_TRANS Start value of transparency component for run or blend. 

4 ND_RE0 End value of red component for run or blend. Also used 

as red Contrast. 

5 ND_GREEN End value of green component for run or blend. Also used 

as green Contrast. 

6 ND_8LUE End value of blue component for run or blend. Also used 

as blue Contrast. 

7 ND_TRANS End value of transparency component for run or blend. 

8 P_START__H Horizontal Page Start address. 

9 P_START_V Vertical Page Start address. 

10 S_START_H Horizontal Screen Start address. 

11 S_START_V Vertical Screen Start address. 

12 UCODE Microcode register. 

13 ALT_RL Alternate run length register. 

14 T_0FP texture offset register. 



TABLE 6 



Bit ft 31 30 29 28 27 26 25 24 23 22 21 20 19 18 17 16 
Function 0 0 0 1 HWR SHR PI R 



TABLE 7 



ADDRESS 


MODE 


NAME 


SIZE 


DESCRIPTION 


00 


WRITE 


COMMAN0 


32 


Command input 


01 


WRITE 


CONTROL 


16 


general control 




READ 


STATUS 


1$ 


general status 
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TABLF ft 

S1t ff 31 30 - 25 24 23 22 .. 17 16 15 14 9 8 7 6 

Instruction COMMA NO t.O. R. l 



Operands 7 R 0 7 T. Q_ 



J. B 0 7 t • n 



TABLE 9 



Bit * '5 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 

Function HIP STF E IE CHI CSI CFT RST <:u r SEE OF TMS TM4 TM3 TM? TMl Tj an 



TABLE IP 



Bit * '5 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 

Function HTF SIE EIE HI F SIF FT F FFF r. P OOF T<^ TV T S1 TSO V? XM un 
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TABLE 11 



Type 


Name 


tfjnputS 


^outputs 




Description 


1960 


HCLK 


1 






1960 Processor clock 33MHz 




RESET- 


1 






Power on reset 




HD<31..0> 






32 


Host data bus 




CE- 


1 






cHId *nable 




RS 


1 






regl ster select 




READ- 


1 






read enable 




DACK- 


1 






0MA acknowledge 




NEXT- 


] 






npvf DMA wor~ff prtrihlp 




DREQ- 




] 




DMA rpniiP<jt" 




INT- 




1 




Interrupt request 


Comp . 


CLK2 


1 






twice pixel clock, 27MHz 


Mem- 


PXCLK 


1 






pixel clock, 13.5MHz 




CD_0E- 


1 






comno<; i t 1 no data hus ouirnut 












enab 1 e 




C0(31 0> 












LA0O2- .0) 




1 3 




minnow it*inn 1 i np arirf i"** ^ <i 




PAD<12. .0) 




13 




comno5 1 tina oixel address 




BUFWI- 


- 




- 


buffer write Image 




BUFWM- 








buffer write matte 




BUFR- 








buffer read 


pan/ 


GFX_ST- 








start of graphics run 


zoom 


VERT- 








screen verti cal run 




LD_ZOOM- 




1 




load zoom ratios 




SC^WE-O. .0) 








screen byte lane write enables 




SC_REA0 




} 




screen read 




SC_ WRITE 




1 




screen wri te 






] 






screen reao uata acKnowteoge 




SC_WDACK- 


1 






screen write data acknowledge 


other 


TESTIN 


1 






test mode Input 




TESTOUT 




1 




test mode output 




ERR- 




1 




error indicator 




GFX RUN- 




1 




graphics engine running 












Indicator 




RESTART- 


1 






restart command from VINEGAR 




TEST<2:0) 


3 






test inputs 




TE$T3 




1 




test outout 
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1- A graphics engine to manipulate colour image data on a pixel-bv-oixel ba^i* « aM m • 

parallel data path for each one of three primary colours of said Z^d^LS IT! C ° mprisin 9 a 
cascade connected interpolator and compositor and said date paths bio rnn ^ C ° mprisin 9 a 
cascade connected transparency interpolator and Z TrZ^T I 9 COnnected ,n P ara »el with a 
polators receive colour variation LrSZ^ e3Ch ° f Said co,our 

said pixel in turn and said composite^ far each 

mined func,on the output of which is dependent u^ 

ation command, and wherein said transtwpnrw inw™i^ . coiour aata and the interpolator van- 

a colour blend value and an output matte value for eac T^L ^JIi h . d comb ''»''l*oduces 

in accordance with s„d transparency vartaS T^^ZZZti^ZZ^JZ™"" l'* 
image data simultaneously manipulated ascolour. transparency m matte 9 6e '" 9 C °"" ,r 

' ^^^^SrS,'^," S ' M — «— ' — - — d ,rom t.e 

^ -r^ate^ree^rin^ 

colour variation commands. interpolated for a specific duratron determined by the 

A graphics engine as claimed in anv one of riaim« 1 t„o • 

are selected ,rom ,„e g rou P ^Z^^l^ZZ^^ 

£c«™^ 

determined >y th. transparency va^on^rnTanl " '« ■ duratton 

The graphics engine as claimed in anyone of claim* itnc «,h~~* * r 

by said data is provided with a uj^^^^*?^'"* ^ P3ft ° f the ima 9 e represented 
predetermined colour by said c^ZSor^^S? * " maWe C ° mb '' ner 3nd C °'° ured with a 
coloured, the matte plane o LiZaTp^li T Z.ZT ^ " Simu,taneo -'y «^«d and 
data of said image part. " " H-™ a cnangea ,n accordance with the colour data or texture 

to 6° IOUr ,ma9e data manipu,ati0n -'^ing a graphics engine as claimed in any one of claims 1 

A ramp generator comprising: 

means for storing a number (N) as a plurality of bits of information- 
mined:maxl C uTand r 96nerat '' n9 * determined minimum and a predeter- 

ofoper^TwS 

mode, said output increases at a raTe equal o oT^Zs7lT ^TJ^V^ ™ T J " Ump 
the rea. time taken for said output to traverse be^nSd ^^^ZSSiST 

is 0 % of the interpolation 

^SST^ 38 C ' aimed ^ daim 8 " 9 Wh6rein S3id -P-sents the run length of a graphics 

and Sd^r^ 8 * " ^ -id number includes at .east 8 bits 

mode if said number t. el than St ^ " ^ ' S «»* ' tha " 255 " said jump 



mode if said number is less than or equal to 255 
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12. A ramp generator as claimed in any one of claims 8 to 1 1 wherein said accumulator comprises a first mod- 
ule for calculating 255/(N-1), a second module for calculating (51 1-N) and a third module for calculating 
(512-2N), there being a first multiplexer connected to a zero vaue and an output of said second module, 
a second multiplexer connected to a constant (510) and the output of said third module and a third mul- 
tiplexer having inputs connected to the output of said first module and the output of said second multiplexer, 
and adder having one input supplied by the output of said third multiplexer and outputting to a fourth mul- 
tiplexer having its other input connected to the output of said first multiplexer, and an accumulating flip 
flop connected to the output of said fourth multiplexer whose output supplied an output of said ramp 
generator and a second input to said adder. 

13. A ramp generator as claimed in claim 12, and comprising a counter connected to the output of said 
accumulating flip-flop and having its count value incremented thereby, and a fifth multiplexer having one 
input connected to the count value output of said counter and the other input connected to said output of 
said fourth multiplexer, the output of said fifth multiplexer being switchable between said count value to 
provide a step mode ramp output and said fourth multiplexer output to provide an alternate, jump mode 
output. 

14. A graphics engine as claimed in claim 3 or claim 5 including the ramp generator as claimed in any one of 
claims 8 to 13 and wherein said specific duration determines the ramp output of the ramp generator. 

5. A desktop publishing system comprising a computer (1), a colour output device (3,7) and a rendering pro- 
cessor (10) arranged to perform rendering operations (e.g. colour fills, mattes, or mixing two images) to 
generate image data supplied to said output device. 

6. A system according to claim 15 which includes a workscreen memory (30) coupled to an image display 
apparatus (3) and a page memory (4) coupled to an image printer (7), said rendering processor (10) being 
arranged to access said memories (30, 4). 

7. Apparatus for processing image data to provide a colour image output, which comprises means for operat- 
ing upon red, green, blue, and matte or transparency information, wherein separate means are provided 
for each of the colour information and the matte information is processed by one of said colour processing 
means. 

8. A graphics engine including a grading circuit arranged to interpolate across a run of pixels from a first value 
to a second value, said grading circuit including a digital counting circuit arranged to produce a ramp of 
successive values for use on successive pixels. 

9. Image processing apparatus comprising a rendering processor (1 0) for performing colour and matte oper- 
ations on image data and, coupled thereto, a scaling processor (9) arranged to accept image data and to 
produce output image data of a different resolution corresponding thereto so as to expand or shrink por- 
tions of the image. 
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